Combining Mllr Adaptation and Feature Extraction for Robust Speech Recognition in Reverberant Environments

نویسندگان

Aik Ming Toh

Roberto Togneri

Sven Nordholm

چکیده

This paper presents an investigation on speech recognition performance in reverberant environments. Reverberant noise has been a major concern in speech recognition systems. Many speech recognition systems, even with state-of-art features, fail to respond to reverberant effects and the recognition rate deteriorates. This shows the limitations of robust feature extraction in reverberant environment. The maximum likelihood linear regression (MLLR) adaptation scheme is adopted for reverberant speech recognition on the TI-DIGIT database. The use of adaptation data improved the recognition performance significantly especially for strong reverberations. The performance of both MFCC 0 and MFCC 0 D A features improved by more than 10% for reverberations greater than 0.4s. This paper also demonstrates the optimal strength of both robust feature extraction and adaptation scheme for reverberant speech recognition. The recognition performance is maintained above 90% up to reverberation time 0.5s using both schemes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

متن کامل

Robust Asr in Reverberant Environments Using Temporal Cepstrum Smoothing for Speech Enhancement and an Amplitude Modulation Filterbank for Feature Extraction

This paper presents techniques aiming at improving automatic speech recognition (ASR) in single channel scenarios in the context of the REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. System improvements range from speech enhancement over robust feature extraction to model adaptation and word-based integration of multiple classifiers. The selective temporal cepstrum ...

متن کامل

Fast Adaptation for Robust Speech Recognition in Reverberant Environments

We present a fast method, i.e. requiring little data, for adapting a hybrid Hidden Markov Model / Multi Layer Perceptron speech recognizer to reverberant environments. Adaptation is performed by a linear transformation of the acoustic feature space. A dimensionality reduction technique similar to the eigenvoice approach is also investigated. A pool of adaptation transformations are estimated a ...

متن کامل

Noise adaptation for robust AURORA 2 noisy digit recognition using statistical data mapping

The mismatch between system training and operating conditions often has negative influences on automatic speech recognition (ASR) systems. Noise in the operating environments is commonly encountered. ASR model adaptation is an important way to enhance the system performance in noisy environments. This paper proposes a feature-based statistical data mapping (SDM) approach for robust noisy digit ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Combining Mllr Adaptation and Feature Extraction for Robust Speech Recognition in Reverberant Environments

نویسندگان

چکیده

منابع مشابه

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

Robust Asr in Reverberant Environments Using Temporal Cepstrum Smoothing for Speech Enhancement and an Amplitude Modulation Filterbank for Feature Extraction

Fast Adaptation for Robust Speech Recognition in Reverberant Environments

Noise adaptation for robust AURORA 2 noisy digit recognition using statistical data mapping

عنوان ژورنال:

اشتراک گذاری